Programming Research Group ALGORITHMIC AND PRACTICAL CONSIDERATIONS FOR DENSE MATRIX COMPUTATIONS ON THE BSP MODEL
نویسنده
چکیده
The Bulk-Synchronous Parallel (BSP) model of computation has been proposed by L.G. Valiant as a unifying and bridging model for the design, analysis and implementation of parallel algorithms that are both scalable and portable, among diverse parallel machines. In this work we present direct BSP algorithms for dense matrix computations such as matrix multiplication, LU decomposition, determinant nding and matrix inversion. Two established distributions of an input matrix among the processors are examined and the particular theoretical and practical merits of each one are discussed. As the BSP model is not just an abstract computational model, it can also be used as a programming paradigm for the implementation of parallel algorithms; towards this direction we implemented the aforementioned algorithms on top of the Oxford BSP Toolset, BSPlib, that provides library support for BSP programming. Our implementations were tested for scalability and portability on a variety of parallel systems that included the SGI Power Challenge, the IBM SP2 and the Cray T3D. Our experimental results support our claims of eeciency, portability and reusabil-ity of the presented algorithms across the named hardware platforms, and reinforce our convictions that the BSP model can also be successfully used as a programming platform or, indeed, as a kind of programming paradigm.
منابع مشابه
Confluence of the BS λ - calculus
The BSλ -calculus, a formal basis for functional languages expressing bulk synchronous parallel algorithms, is presented. It is then shown to be confluent. 1 High-level languages and parallel programming For parallel programming to become as widespread as sequential programming, the languages supporting it should incorporate all the standard abstraction mechanisms including higher-order functio...
متن کاملCommunication-Efficient Bulk Synchronous Parallel Algorithms
Communication has been pointed out to be the major bottleneck for the performance of parallel algorithms. Theoretical parallel models such as PRAM have long been questioned due to the fact that the theoretical algorithmic efficiency does not provide a satisfactory performance prediction when algorithms are implemented on commercially available parallel machines. This is mainly because these mod...
متن کاملSystematic Development of Functional Bulk Synchronous Parallel Programs
With the current generalization of parallel architectures arises the concern of applying formal methods to parallelism, which allows specifications of parallel programs to be precisely stated and the correctness of an implementation to be verified. However, the complexity of parallel, compared to sequential, programs makes them more error-prone and difficult to verify. This calls for a strongly...
متن کاملA Globally Convergent Parallel Algorithm for Zeros of Polynomial Systems
POLYNOMIAL systems of equations frequently arise in solid modelling, robotics, computer vision, chemistry, chemical engineering, and mechanical engineering. Locally convergent iterative methods such as quasi-Newton methods may diverge or fail to find all meaningful solutions of a polynomial system. This paper proposes a parallel homotopy algorithm for polynomial systems of equations that is gua...
متن کاملOptimisation Modelling Innovations
Model development is perhaps the most intellectually challenging part of practical mathematical programming. The mathematical representation of a model is typically very short and understandable. Algebraic modelling languages allow models to be implemented in such a way that they can easily be understood and modified. Having once developed models, practical applications should integrate them in...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997